Data quality in ETL process: A preliminary study
نویسندگان
چکیده
منابع مشابه
Design of ETL Process on Spatio-temporal Data and Study of Quality Control
In order to use the space-time data mining technology to conduct operation research in WuLiangSuHai Eutrophication, the water quality sensor parameters of heterogeneous data which reflect the characteristics should set up a spatial data warehouse through ETL process, and water quality sensors for quality control of spatial and temporal data plays a vital role in building an effective analytical...
متن کاملPOIESIS: a Tool for Quality-aware ETL Process Redesign
We present a tool, called POIESIS, for automatic ETL process enhancement. ETL processes are essential data-centric activities in modern business intelligence environments and they need to be examined through a viewpoint that concerns their quality characteristics (e.g., data quality, performance, manageability) in the era of Big Data. POIESIS responds to this need by providing a user-centered e...
متن کاملETL and Data Quality : Which Comes First ?
Usually, an early task in any data warehousing project is a detailed examination of the source systems, including an audit of data quality. Data quality issues could include inconsistent data representation, missing data and difficulty around understanding relationships between the various source systems. As ETL and Data Quality technologies converge, it’s important to use the right tools at th...
متن کاملQuery Optimizer for the ETL Process in Data Warehouses
ETL (Extraction-Transformation-Loading) process is responsible for extracting data from several sources, cleansing, transforming, integrating and loading into a data warehouse. Extraction process accesses large amount of data by executing several complex queries in source databases. These queries are repetitive and executed at regular interval to refresh the data warehouse. Extraction of data f...
متن کاملpattern recognition in maintenance data using methodologies data minitng (cade study isfahan regional power electric company)
فعالیت های نگهداری و تعمیرات اطلاعاتی را تولید می کند که می تواند در تعیین زمان های بیکاری و ارایه یک برنامه زمان بندی شده یا تعیین هشدارهای خرابی به پرسنل نگهداری و تعمیرات کمک کند. وقتی که مقدار داده های تولید شده زیاد باشند، فهم بین متغیرها بسیار مشکل می شوند. این پایان نامه به کاربردی از داده کاوی برای کاوش پایگاه های داده چندبعدی در حوزه نگهداری و تعمیرات، برای پیدا کردن خرابی هایی که موجب...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2019
ISSN: 1877-0509
DOI: 10.1016/j.procs.2019.09.223